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C"^ I Abstract Phase steps are an important type of wavefront aberrations generated by large telescopes with seg- 

mented mirrors. In a closed-loop correction cycle these phase steps have to be measured with the highest possible 
Y^ I precision using natural reference stars, that is with a small number of photons. In this paper the classical Fisher 

information of statistics is used for calculating the Cramer- Rao bound, which determines the limit to the precision 
with which the height of the steps can be estimated in an unbiased fashion with a given number of photons and 
Q '. a given measuring device. Four types of measurement devices are discussed: a Shack-Hartmann sensor with one 

jy! I small cylindrical lenslet covering a sub-aperture centred over a border, a modified Mach-Zehnder interferometer, 

C^ . a Foucault test, and a curvature sensor. The Cramer-Rao bound is calculated for all sensors under ideal con- 

jy-j ' ditions, that is narrowband measurements without additional noise or disturbances apart from the photon shot 

^->. noise. This limit is compared with the ultimate quantum statistical limit for the estimate of such a step which 

i-C ' is independent of the measuring device. For the Shack-Hartmann sensor, the effects on the Cramer-Rao bound 

of broadband measurements, finite sampling, and disturbances such as atmospheric seeing and detector readout 
noise are also investigated. The methods presented here can be used to compare the precision limits of various 
devices for measuring differential segment phases and for optimising the parameters of the devices. Under ideal 
conditions the Shack-Hartmann and the Foucault devices nearly attain the ultimate quantum statistical limits, 
whereas the Mach-Zehnder and the curvature devices each require approximately twenty times as many photons 
^~^ ' in order to reach the same precision. 
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'^ ; 1 Introduction 

• 1—1 

(-») I of such a telescope the wavefront errors introduced by segmentation have to be reduced to the order of a few 

tens of nanometers. If the segments are regarded as rigid bodies, the wavefront errors can only be generated by 
tip-tilt errors of individual segments or differential axial displacements. This paper assumes that the segments are 
^^ I already perfectly aligned in tip and tilt. A correction of remaining piston errors - from now on called differential 

Vj '_ segment phases - requires measurements of the heights of the wavefront steps at the borders of adjacent segments 

with a precision of a few nanometers, and the suppression of the relative segment phases by appropriate rigid 
body movements of the segments. 

Several methods are used or have been proposed for the measurement of differential segment phases in seg- 
mented mirrors. One method is based on the Shack-Hartmann technique and applied in a slightly modified 
fashion at the Keck telescope [1][2][3]. In an image of the segmented mirror a circular lens is placed over a 
sub-aperture centred on an intersegment border. The phase step across the border is estimated from the form 
of the diffraction pattern, for narrowband and broadband measurements and in the presence of atmospheric 
disturbances. Other methods proposed in the literature use spatial filtering in a modified Mach-Zehnder interfer- 
ometer [4] , measurements in defocussed images and pupils [5] [6] [7] [8] , phase filtering techniques [9] , or a knife 
edge technique in the pyramid wavefront sensor [10][11][12][13]. 
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In future extremely large telescopes at least one of the mirrors will probably be segmented. During the operation 



All the sensors discussed in this paper generate diffraction patterns in the plane of a detector. These can 
be regarded as likelihood functions p{x \ 9) describing the "likelihood" to register a photon on the detector as a 
function of the spatial variable x on the detector, given a certain value of a parameter, in this paper the relative 
segment phase 6. There are various possibilities to define, based on the derivatives of the p{x \ 9) with respect to 
0, an information content in the likelihood functionOne of them is the classical Fisher information [14]: 

^ Axp{x\e)^ ^^ j . (1) 

Estimating the differential segment phases is particularly difficult at low light levels, e.g. when only the light 
of faint stars is available for the measurements. The question arises what precision can ultimately be obtained 
with a given number of detected photons. In statistics, precision is characterized by the conditional variance of 
a quantity. For a parameter of interest, in this paper the relative segment phase 9, the minimum variance of an 
unbiased estimate of the parameter, the so-called Cramer- Rao minimum variance bound cr^j^g [14], is given by 
the inverse of the classical Fisher information If{9): 

<^crbW = 1/^fW . (2) 

In the following the shorter expression Cramer-Rao bound will be used for the square root ctcrb of the Cramer- 
Rao minimum variance bound. 

Two of the assumptions in the derivation of equation (2) are, unless it is obvious, checked for all cases 
investigated in this paper. First, the integral of the likelihood function over the interval for which it is defined 
must not depend on 9, at least for a given value of 9 for which the Fisher information is calculated. Second, the 
likelihood function must be regular with respect to 9. Under the first assumption the likelihood function can be 
normalised: 

dxp(x\9)^l. (3) 

This is equivalent to the assumption that one photon passes through the aperture. 
If q{x I 9) denotes the real-valued likelihood amplitude, defined by 

q{x I 9) = VpR^, (4) 

the integrand in equation (1), which we will call the Fisher information density ip{x \ 9), can be written as 



dq{x I 9) 
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*F(:^,^)=4^^H7r^ . (5) 



In principle there is an ambiguity related to the definition of the Fisher information density. This is due the 
identity 



d-(^^^^^^) =-/ d.,(x|^)l^^^^ (6) 



which can be obtained by differentiating equation (3) twice with respect to 9. The Fisher information defined as 

(7) 
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would be independent of the choice of A and B provided that A + B = 1. However, the term following the 
coefficient B can be negative, possibly giving rise to a negative Fisher information density. Since this seems 
unreasonable, B is set to zero and the Fisher information density is defined as in equation (5). The Fisher 
information will therefore be computed as 

-^ /a^y^ (8) 



According to equation (2) the computation of the Fisher information offers a possibihty to calculate analytically 
the potential precision limits of various measurement methods. These limits can then be compared with the 
ultimate limit which is independent of any measuring device. The latter is calculated in section 3 from first 
principles by introducing the wave function describing the wavefront immediately after the reflection by the 
mirror, generating a relative segment phase, into the quantum statistical counterpart of the expressions (1) or 
(8) for the Fisher information. 

Section 4 discusses the Cramer-Rao bounds for the Shack-Hartmann, the Mach-Zehnder, the Foucault and 
the curvature sensor under ideal conditions, that is for monochromatic light and without any effects introducing 
additional noise or light losses or aberrations. The effects of broadband measurements, finite sampling, and 
external disturbances such as seeing and detector readout noise can also be included in the calculations. The 
methods are outlined in detail only for the Shack-Hartmann sensor in section 6, but could readily be applied to 
the other sensors as well. 

In general, the limits given by the Cramer-Rao bound can only be reached if efficient estimators are available 
for the analysis of the data. This issue will, however, not be discussed in this paper. Furthermore, all calculations 
will be carried out for the one-dimensional case. 



2 Definition of the wavefront error 

Let, as shown in figure 1, the pupil be defined by the interval [— a, +a] with the spatial variable in the pupil 
denoted by Xp. An out-of- phase segment with its centre at c generates a constant wavefront error or phase 
difference (p{xp) = 9 over its full width 2w compared to a phase, also constant, over the rest of the pupil. There 
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Figure 1: Wavefront phase if witli a relative segment phase 9 generated by a segment of width 2w and a centre at c 
within a subaperture of width 2a. 

are two wavefront discontinuities, that is phase steps of height 6, at the segment borders. Apart from an irrelevant 
average phase, the complex amplitude describing the wavefront is given by 

C/s(xp|0) = -^e^'=^(^p)T4^(a;p,-a,+a) , (9) 

V2a 



with 



—a < a^p < c — w 



(/3(xp) = •^ 9 for c — w<Xp<c + w ^ (10) 

and 



c + w < Xp < a 



) for Xp < —a 
W(xp,-a,+a) = ■l^ 1 for —a<Xp<+a '^ , (11) 

) for +a < Xp 

where k = 2tt/X is the wavenumber and A is the wavelength of the light. The restriction of the definition to 
the interval [—a, +a] signifies that only the photons passing through the aperture of width 2a will be used for 
estimating the relative segment phase param, and the factor in front of the exponential in equation (9) normalizes 
the wave function such that the number of photons within this sub-aperture is equal to 1 , that is 

dxpU:{xp\9)Usixp\9)^l , (12) 



where U* is the complex conjugate of Us- Apart from the general or two-border configuration shown in figure 1, 
two special configurations will be discussed in more detail. One is given if the segment is located at the edge 
of the aperture [—a, +a], that is if c = a — w. This configuration will subsequently be called the single-border 
configuration. The second special configuration, which is a special case of the single-border configuration, is 
given if the border is at the centre of the aperture [—a, +a\, that is if c = w = a/2. This configuration, which is 
of particular interest for the Shack-Hartmann type sensor, will be called the centred single-border configuration. 
For the general configuration the parameter 9 will be referred to as the relative segment phase, and for the 
single-border configurations as the phase step. 



Quantum statistical precision limit for the estimate of a relative 
segment phase 



The ultimate precision limit for the measurement of differential segment phases can, if at all, be obtained with a 
measuring device which is capable of transferring the maximum relevant information contained in the source to 
the data. In addition, one needs an appropriate, that is efficient data analysis procedure. 

As stated in the introduction, the limit on the data side can be obtained from the Fisher information which 
is derived from a classical, real-valued likelihood function. However, on the source side, immediately after the 
rcfiection of an incoming monochromatic plane wavcfront by the mirror configuration shown in figure 1, the 
full information is contained in a complex quantum statistical wave function 'ip{x\9), which is equivalent to the 
complex amplitude function (9) familiar from optical theory. Therefore, the calculation of the information in the 
source requires the quantum statistical equivalent of the expression (8) for the Fisher information. For a pure 
state ip and its complex conjugate ip* this is given [15] by 
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(13) 



The first term on the right hand side is similar to the definition of the classical Fisher information density in 
equation (8). The second term, in effect, suppresses a dependence of /qm on an undetectable average phase 
across the full aperture [— a, -|-a]. 

By replacing the complex wave function ip in equation (13) by the complex amplitude Ug of equation (9) one 
obtains the quantum statistical equivalent of the Fisher information about the relative segment phase contained 
in the complex amplitude immediately after the reflection by the mirrors: 



/QM(^)=4fc2 



1- 



(14) 



The quadratic dependence of the quantum statistical information about the segment width, which is independent 
of the segment position, is shown in figure 13 by the solid line. The maximum 



-fQM,max(^) — k 

is obtained if the segment covers half of the aperture, that is for w = a/2. 



(15) 



For a single photon, the lower bound for the root-mean-square (RMS) of the uncertainty in the estimate of 
the parameter 9, that is the quantum statistical equivalent of the square of the classical Cramer-Rao minimum 
variance bound, is given by 

CTQM,CRB4(6') = —. I , =^ ■ (16) 



2fc \/vj(a — w) 

The minimum value is obtained, if the segment covers half of the pupil: 
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(17) 



It depends only on the wavelength of the light, and is of the order of 100 nm for visible wavelengths. For A^phot 
registered photons the Fisher information increases by a factor of Aphot, the quantum statistical Cramer- Rao 



bound decreases by l/y^iVphot, and the lower bound for the RMS of the uncertainty in the estimate of the 
parameter 9 is correspondingly given by 

0"QM,CRB,min(^) = —, ^^ TT ' (1^) 

This fundamental limit does not refer to a particular measurement device or a particular data analysis procedure. 
It is based on the maximum information that can potentially be extracted from the wavefront, without necessarily 
implying that such an cfhcicnt measurement process exists. 



4 Cramer- Rao bounds for the measurement of piston steps with dif- 
ferent sensors under ideal conditions 

The Cramer-Rao bounds for measuring the relative segment phase or phase step 9 are calculated for the four 
sensors mentioned above, under ideal conditions, that is under the following assumptions: light from a quasi- 
monochromatic source, an infinitely fine sampling, and, apart from the inevitable photon shot noise, no additional 
noise introduced by atmospheric effects or the readout characteristics of the detector. 

The phasing in a segmented mirror is done in two steps. First, after the installation of new segments, initial 
large differential segment phases of the order of possibly several wavelengths have to be reduced to steps of the 
order of a fraction of a wavelength. Usually, this only needs to be performed once and can therefore be done with 
bright stars. Since they supply a large number of detectable photons, high accuracies can be obtained during this 
process. Second, during observations, one has to be content with small number of photons, but has to correct 
only comparatively small differential segment phases. The precision limits or Cramer-Rao bounds therefore need 
to be calculated only for the case of very small differential segment phases or phase steps, that is for 9^0. 



4.1 Shack-Hartmann method 

Usually, the Shack-Hartmann method used for phasing measurements consists of placing a circular lens across 
the border between two segments. The information on the phase step is contained in the diffraction pattern of 
the lens. In a one-dimensional setting, a cylindrical lens is used with its axis being parallel to the border, and its 
center ideally exactly on the border. The normal configurations for the Shack-Hartmann sensor are therefore the 
single-border and the centred single-border configuration. However, to be able to compare the Shack-Hartmann 
sensor with the other phasing wavefront sensors, the computation of the likelihood function in the focal plane and 
the Fisher information, presented in appendix Al, are done also for the general configuration shown in figure 1, 
that is for a single lens covering the full aperture with one out-of-phase segment in an arbitrary location. The 
likelihood function p(^ | 9) and the Fisher information {^{^ \ 9) are both functions of the phase step 9, and a is 
a dimensionless position variable ^, which is related to a true position variable x in the focal or detector plane 
by C = 2;//, where / is the focal length of the lens. The general as well as the special configurations will be 
discussed in this section. 

Section 4.1.1 discusses the likelihood function, the Fisher information density, and the Fisher information. A 
variation of the parameter 9 causes a shift of the maximum of the diffraction pattern as well as a change of its 
form. From both characteristics partial information about 9 can be obtained. Section 4.1.2 shows that most of 
the information is contained in the shift of the maximum, and section 4.1.3 that a quarter of the full information 
is contained in the change of the form, that is, independent of the horizontal position of the likelihood function. 



4.1.1 Likelihood function, Fisher information density and Fisher information 

Figure 2a shows for the centred single-border configuration the diffraction patterns, or equivalently the likelihood 
functions p{^ \ 9), for four different phase steps 9 ranging from zero to A/2. The location of the first zero in the 




Figure 2: Shack-Hartmann sensor: (a) Likelihood functions for four differential segment phases S as a function of a the 
normalised angular coordinate in the focal plane of the Shack-Hartmann lens, (b) Likelihood function and corresponding 
functions related to the Fisher information for 9 — 0: normalised likelihood function p(^ | ^)^o (dotted), normalised Fisher 
information density iF(^ | 6)^0 /k^ (solid), integrated likelihood function and dashed : integrated Fisher information density 
divided by k^ )dashed-dotted). 

diffraction pattern for = will be denoted by ^0 = A/ (2a). For the centred single-border configuration and for 
arbitrary d the Fisher information density and the likelihood function are related by 



i^{i\d) = ev{i\d-\i2) 



(19) 



Figure 2b shows that for the case = the first maxima of the Fisher information density on both sides of 
the origin are lying just inside the first minima of the likelihood function, represented by the dotted curve. The 
dashed line in figure 2b shows the Fisher information density integrated from ^ = to the ^ value on the abszissa 
and divided by fc^, and the dashed-dotted line the corresponding integrated likelihood function. The likelihood 
function is more concentrated around the origin than the corresponding Fisher information density. For the case 
Q = A/2 the likelihood function has a minimum at ^ = and the Fisher information density consequently its 
maximum at ^ = 0. Therefore, for 6 = A/2, the Fisher information density is more concentrated around the 
origin than the corresponding likelihood function. 

The Fisher information is obtained by integrating the Fisher information density from ^ = — cx3 to ^ = -|-oo. 
For the general configuration in the limiting case 0^0 the integration of the Fisher information density given 
in equation (A- 5) can be done analytically and yields 



/f,sh = k -min(w, |c|) . 
a 



(20) 



Since the maximum of min(w, |c|) equals a/2, the Fisher information in equation (20) is always bounded by the 
quantum statistical information given in equation (15). For the special case of the single-border configuration 
this can be seen in figure 13a where the dashed line shows the linear dependence of the Fisher information for 
the Shack-Hartmann sensor on the segment width 2w. 

For the general configuration and for a given w = 0.1a, figure 13b shows the Fisher information for the 
Shack-Hartmann sensor as a function of the location of the segment centre. Surprisingly the Fisher information 
is zero for a segment in the centre of the aperture. 

The Fisher information for the Shack-Hartmann sensor attains its maximum for the centred single-border 
configuration, that is for w — \c\ — a/2: 

= k' , 



/i 



F,SH,max 



(21) 



and is identical to the quantum statistical Fisher information. Therefore, if all of the relevant information about 
the relative segment phase in the diffraction pattern could be extracted with an efficient estimator, one could 
reach the maximum possible precision, that is the fundamental bound (15) for the estimate of the phase step 6. 



The formulae given above have been derived for one lens covering the full aperture [— a, +a]. However, in a 
phasing wavefront sensor, a single lenslet will only cover a subaperture across a border. For these lenslets the 
configuration will ideally be the centred single-border configuration. It can readily be verified that due to the 
linear dependence of the Fisher information on the segment width, the information /p obtained with a single 
lens across the full aperture is identical to the information obtained with lenslets across the borders and centred 
on the borders, provided that the lenslets cover the full segment width and the segment centre is not too close 
to the centre of the aperture. Since the other sensors work over the full aperture, the Fisher information for the 
Shack-Hartmann method can therefore be compared with the other methods by using the results for a single lens 
covering the full aperture. 

4.1.2 Information contained in the shift of the maximum of the likelihood function 

Figure 2b, applying to the centred single-border configuration, shows that in the most interesting case of <C A 
nearly 50% of the Fisher information is contained in the interval between the first zeroes of the likelihood function 
on both sides of the origin. The maximum of the likelihood function is well defined, even in the case of broadband 
measurements or in the presence of atmospheric disturbances as shown in sections 6.1 and 6.2. Furthermore, 
the position of the maximum depends on the phase step 0. In addition, the position of the maximum is not too 
sensitive to small misalignments between the centre of the lens and the border between segments. For small phase 
steps 9 and small deviations from the centred single-border configuration, described by the difference c — a/2, 
that is for fc0 <C 1 and k{c — a/2) <C 1, one obtains for the location ^max(^) of the maximum of the likelihood 
function 

,_(,). -I ^l-4<£^^) . (22) 

4 a a^ 

This equation shows that the location of the maximum is only weakly affected by the misalignment c — a/2 
provided that c — a/2 <C a. A similar expression for ^niax(^) without the effect of the decentring has been given 
in [2] for circular lenslets. 

For all these reasons, it is of interest to estimate the phase step 9 only from the core of the likelihood function 
which is defined here as the interval where the likelihood function exceeds 20% of its maximum value at the 
centre of the core. Figure 2b shows that for the centred single-border configuration the integral of the Fisher 
information density over that interval is approximately O.S/p.sH.max, that is approximately 30% of the maximum 
information can be extracted from the core. 

The central core can quite accurately be fitted by a Gaussian 

Pgauss(e I 0) = ^ ,-[0.6.a«-3./(4a))]^ (23) 

Its Fisher information is given by 

/p^gauss =0.4/c2ss0.4/F^SH,max ■ (24) 

This is only marginally larger than the information content 0.3/F,sH,max in the core in the mentioned above. The 
reason for the small difference is probably that the integration of the Gaussian probability function extends from 
— oo to +00, whereas the one of the diffraction pattern only over the central core. 

The shift ^mean of thc mean position, i.e. the centre of gravity of the likelihood function, is not identical to 
the shift of the central maximum of the diffraction pattern, ^moan should be smaller than ^max since the shift of 
Cmax to one side is accompanied by an increase of the first peak on the opposite side. For the centred single lens 
configuration one obtains from equation (A-4) for the centre of gravity 

Cmean = - 771— sin(A:6') . (25) 

For k9 <^1 this becomes 

n 

Smean ^- 7^ ■ l^'-'j 

la 
A comparison with equation (22) shows that this is indeed smaller than the shift ^max of the maximum by a 
factor of 1.5. It is equal to half of the average tilt of the wavefront (p{x-p) over the aperture in the pupil. 



4.1.3 Information contained in the form of the likeUhood function 

In sonic applications a reference pattern generated by the Shack-Hartmann lenslet array may not be available. 
Furthermore, an adaptive optics system may partially correct a phase step, in the simplest case by introducing 
a constant tilt over a subaperture covered by a lenslet. Such a tilt, the amount of which may not be known, will 
lead to a shift of the diffraction pattern. Under such circumstances no information can be obtained from the 
position of the diffraction pattern, and the parameter 6 can only be estimated from the changes in the form of 
the likelihood function. 

The Fisher information contained in the form, from now on called the form-only Fisher information, can be 
calculated as follows. The likelihood function for a given phase step 9 can be shifted such that its maximum is 
closer to the origin. The Fisher information can then be calculated as a function of this shift, with the information 
contained in the form being defined as the minimum of the Fisher information as a function of the shift. For the 
centred single-border configuration a likelihood function p^ shifted by a9 /a is described by 
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[sin(7rC/Co - a2TTe/\ + 27r6'/A) - sin(27r6'/A)]' 



(27) 



Figure 3a shows for a few values of a the Fisher information densities for 6 = 0. For arbitrary values of 6 the 
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Figure 3: Shack-Hartmann sensor: (a) Normalised Fislier information density %¥(£, \ 0)^0/ k compensated by various 
sliifts of a6/a for the limit 6 —f 0. The values for a are given in the figure, (b) Normalised Fisher information density in 
the form of the likelihood function for four phase steps 0. 

minimum Fisher information is analytically obtained for a = 3/4. According to equation (22), for 9 ^ X such 
a shift is equivalent to moving the maximum of the likelihood function back to ^ = 0. Figure 3b shows the 
form-only Fisher information densities for four phase steps ranging from 9 = to 9 = A/2. A comparison with 
figure 2b shows that for 9 = the peaks of the Fisher information densities are much lower than in the case of 
the full information and are further from the origin. 

For all values of 9 the minimum Fisher information in the centred single-border configuration is given by 

/F,SH,form - fcV4 • (28) 

Therefore, if an estimate of the phase step 9 can only be based on the form of the likelihood function, one needs 
four times the number of photons to reach the same precision as is obtainable from the full information, which 
includes the information from the position. 



4.2 Mach-Zehnder interferometer 



The principle of the Mach-Zehnder interferometer as used for the detection of differential segment phases [4] is 
shown in figure 4. The incoming light is focussed and split into two beams. One of them, which serves as the 
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Figure 4: Mach-Zehnder sensor: Principle of a Mach-Zehnder interferometer as used for the detection of differential 
segment phases. 



reference beam, contains a spatial filter in the focal plane and a phase plate introducing a phase shift (f>. The 
light from the two beams is then recombined behind a second beam splitter, coUimated and registered on two 
detectors in planes which are conjugated to the entrance pupil. The spatial filter is either a pinhole with a sharp 
edge or with a Gaussian transmission function with 100% transmission at the centre. If ^ is an angular coordinate 
in the focal plane, the transmission functions of the filters are given by 



m 



1 for |e|<C 
for lei > C 

exp(-eV(2C')) 



Sharp — edge 
Gaussian 



(29) 



where C, also in angular coordinates, is either half of the diameter of the sharp pinhole or the RMS of the 
Gaussian transmission function. The computation of the likelihood function and the Fisher information density 
is outlined in appendix A2. For a sharp-edge pinhole and = A/10, figure 5a shows the likelihood functions 
for four different half-diameters C of the sharp-edge pinhole, expressed as fractions or multiples of ^o — A/ (2a), 
and figure 5b the corresponding Fisher information densities. Similar curves for a Gaussian pinhole are shown in 
figures 6a and 6b. 

Obviously, the Fisher information is maximised for some intermediate values of C. On the one hand, for very 
small pinhole widths the form of the likelihood functions resembles closely the form of the phase in the wavefront 
after the reflection by the mirrors. However, the Fisher information is small, since only a small fraction of the 
light entering the second beam interferes with the light from the reference beam. In the limit C ^ one has 
If.mz oc C^- Oil the other hand, for large pinholes the form in the interference pattern and the Fisher information 
density are concentrated around the borders between the segments. In the limit of very large pinhole sizes the 
two beams are, apart from a constant phase shift, identical and the information content goes to zero as 1/C for 
C, ^ oo. The maximum Fisher information must therefore be obtained for an intermediate size of the pinhole. 
For a segment half-width oi w ~ 0.2a this maximum is obtained for C « 0.8^o in the case of a sharp pinhole 
and for ( « 0.4^o in the case of a Gaussian pinhole, indicated by the dashed-dotted lines in the figures 5 and 6, 
respectively. 

For the centred single-border configuration the maxima can be obtained from figure 7a which shows the 
normalised sum /p/fc^ of the Fisher information in the two beams as a function of the ratio of C to ^o ■ In this 
case the maximum of the Fisher information is obtained for ( « 0.465^o in the case of a sharp-edge pinhole and 
C « 0.235^0 in the case of Gaussian pinhole. Figure 7b shows that these sizes of the pinholes maximising the 
Fisher information are for both types of pinholes comparable to the size of the core of the image in the focal 
plane. However, even under ideal conditions the maximum Fisher information is only approximately 12% of the 
quantum statistical Fisher information in the case of the sharp-edge pinhole and 9% in the case of the Gaussian 
pinhole. 




Figure 5: Mach-Zehnder sensor: (a) Likelihood functions generated by a Mach-Zehnder interferometer with sharp 
edge pinholes with radii of 0.1 (dotted), 0.8 (dashed-dotted), 4 (dashed) and 10 times A/(2a) (solid), for a relative segment 
phase 9 — A/10, (b) Corresponding normalised Fisher information densities ip/k^ for 9 = 0. 
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Figure 6: Mach-Zehnder sensor: (a) Likelihood functions generated by a Mach-Zehnder interferometer with Gaussian 
pinholes with RMS values of 0.1 (dotted), 0.4 (dashed-dotted), 2 (dashed) and 5 times A/(2a) (solid), for a relative segment 
phase 9 = A/10, (b) Corresponding normalized Fisher information densities lY/k^ for 9 = 0. 



For the single-border configuration and a width of the pinhole which maximizes the Fisher information the 
dotted hne in figure 13a shows that /f.mz(^) depends quadratically on the segment width, and figure 13b shows 
that for the general configuration the Fisher information is slighly larger for segments closer to the centre of the 
aperture. 

However, if the Mach-Zehnder method is applied in a telescope with several segments, the signals from 
neighbouring borders have to be well separated. Quite arbitrarily it will be defined that this condition is fulfilled, 
if the distances between the border and, in the case of the sharp-edge pinhole, the first zero, or, in the case of 
the Gaussian pinhole, the x-coordinate where the signal has dropped to 10% of the value at the border, are both 
approximately equal to w/3, as shown by the solid lines in the figures 5 and 6. This requires widths of C = ^/w 
for the sharp-edge pinhole and C, = A/(2w) for the Gaussian pinhole. Figure 8 shows that with these choices of 
the pinhole widths the Fisher information for the single-border configuration is now proportional to the segment 
width w. However, for such widths of the pinholes the Fisher information is much lower than for the widths 
maximising the Fisher information. 

For well separated signals the Fisher information densities in the neighbourhood of a border are independent 
of the segment position. As long as the signals from both borders in a two-border configuration are well within 
the aperture, the Fisher information is therefore also independent of the segment location. 



10 



ijk 



(a) 




- v(i\eko 


/ ; * \ Sharp-edge 


(b) - 


1 
1 
1 
1 
I 
1 


/ \ 

/ Gaussian\ 


\ 
\ 
\ 

\ 

\UnfIltered 




1 









f/fo 



Figure 7: Mach-Zehnder sensor: (a) Normalised Fisher information Jp,Mz/fc^ as a function of the normalised pinhole 
radius (^/^o for pinholes with a sharp edge (solid) and with a Gaussian transmission function (dashed) for the centred single- 
border configuration, (b) Intensities filtered with a sharp-edge pinhole (solid) , and with a Gaussian pinhole (dashed) with 
widths which maximise the Fisher information. The dashed-dotted curve represents the unfiltered diffraction pattern for 
a phase step ^ = 0. 
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Figure 8: Mach-Zehnder sensor: Normalised Fisher information /p.Mz/fc^ as a function of half of the segment width w 
in a single-border configuration with pinhole sizes of \/w for the sharp edge pinhole (solid), and A/(2w;) for the Gaussian 
pinhole (dashed). 

4.3 Foucault knife-edge test 

Differential segment phases can also be measured by a Foucault test. The incoming beam is focussed, a knife-edge 
is placed in the centre of the focal plane, and finally the beam is coUimated and detected in a plane conjugated 
to the pupil. By replacing the knife-edge with a glass prism both halves of the beam can be used for the 
analysis [10] [11] [12]. The computation of the likelihood function and the Fisher information density is outlined in 
appendix A3. Figure 9a shows the likelihood functions for three locations of the segment centre with a segment 
half-width oi w ~ 0.2a and for 9/X = 0.1, and figure 9b the Fisher information densities for 9 = 0. The widths 
of the peaks in the Fisher information densities in the neighbourhood of the borders arc approximately equal 
to the widths of the peaks in the likelihood functions. The ratio of the width of the Fisher information density 
in the neighbourhood of a border to the width 2w of the segment is independent of the other parameters and 
approximately equal to 0.2. The information from adjacent borders can therefore always be well separated. 

A numerical integration shows that for the centred single-border configuration the Fisher information in one 
of the exit pupil equals fc^/4. This may intuitively be expected, since only half of the light enters one of the two 
exit pupils and only half of the image in the focal plane is used to retrieve information. Adding up the information 
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Figure 9: Foucault sensor: (a) Likelihood function for a relative segment phase 6 = A/10, (b) Normahsed Fisher 
information density for a relative segment phase 6 = and w = 0.2a, and, for both plots, three different positions c = Q 
(solid), c = 0.5a (dashed) and c = 0.8a (dashed-dotted) of the segment centre. 

in both channels gives Jf.fc = fc^/2. The Foucault sensor with a glass prism therefore extracts potentially half 
of the quantum statistical information. 

The dashed-dotted line in figure 13a shows the Fisher information as a function of the segment width in 
the single-border configuration. The dependence is quadratic, similar to the case of the quantum statistical 
information. For all segment widths the Fisher information of the Foucault sensor equals half of the quantum 
statistical information. For the general configuration figure 13b shows that the Fisher information is effectively 
independent of the segment location. Figure 9b shows therefore that the Fisher information density is larger in 
the neighbourhood of the single-border in the single-border configuration than in the neighbourhood of the two 
borders in the general configuration. 



4.4 Curvature sensor 

In a curvature sensor the detector is placed at a distance I in front or behind the focal plane, as shown in figure 10. 
Since the diffraction pattern is spread over an interval of approximately 2af /I and the intensities arc therefore 
approximately proportional to 1/ f, the detector coordinates in the figures are normalised by a multiplication with 
f /{la) and the likelihood function and the Fisher information density are normalised by a multiplication with 
1/ f . Except for detector positions very close to the aperture, the diffraction pattern can be calculated with the 
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Figure 10: Curvature sensor: Principle of the curvature wavefront sensor with the detector plane being out-of-focus 
by a distance I with respect to the focal plane of the telescope. 
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Frcsnel approximation, which is outlined in appendix A4. In the hniit I -^ the expressions for the hkeHhood 
functions and for the Fisher information densities converge to the respective expressions for the Shack-Hartmann 
sensor derived in appendix Al. However, the curvature method requires the signals from the two segment borders 
to be clearly separated. In other words, the Fisher information densities corresponding to the two borders should 
not overlap. For w = 0.2a such a separation in the likelihood function as well as in the Fisher information density 
seems to be established for /// > 0.002, as shown in the figures 11a and lib. 
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Figure 11: Curvature sensor: (a) Normalised likelihood function for a defocussing ratio 1/ f — 0.002 and 6 — A/10, (b) 
Normalised Fisher information density for l/f — 0.002, both as functions of the normalised detector coordinate xf/(la). 
For both plots : a = Im, / = 20m, c = 0.3a, and w = 0.2a. 
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Figure 12: Curvature sensor: Normalised Fisher information as a function of the defocussing ratio 1/ f , and a — Im, 
f = 20m, c = 0.3a, and w = 0.2a. 



Figure 12 shows in a log-log-plot the Fisher information as a function of the ratio 1/ f , and for the same 
parameters as in figure 11. The regime where /p.cv is approximately constant is the Fraunhofer regime. Let the 
pure curvature regime be defined as the regime where the signals from two adjacent borders are well separated. 
The ratio l/f = 0.002, which is the lower limit for which a separation is guaranteed, lies at the beginning of the 
regime with a constant slope in the log-log-plot, where /f,cv oc \/f/l and which extends at least up to /// w 0.1. 
With a choice of l/f > 0.002 one is well outside the caustic, that is, the correlation of the subapertures is identical 
with that in the pupil. 

A precise definition of the signal width which guarantees a separation of signals from adjacent borders is 
somewhat arbitrary. Requiring the distance of the first zero of the oscillating pattern to be less than a fraction 
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1/z of the segment half-width w leads to the condition 

I 



f 



> 1 



0.8z2A/ 



(30) 



If, for different widths 2w of the segment, the parameter 1/ f is always chosen such that the condition (30) is 
fulfilled for z — 10, the dependence of the Fisher information on the segment width is linear, as in the case of 
the Shack-Hartmann sensor, that is in the Fraunhofer limit. However, for the chosen non-overlap condition (30) 
the values for the Fisher information are approximately 30 times lower than in the Fraunhofer limit. 

If,cv{(^) is effectively independent of the segment location. This is similar to the case of the Shack-Hartmann 
sensor, except that the central dip in the curve for the Shack-Hartmann sensor in figure 13 is not present in the 
case of the curvature sensor with sufficient defocussing. 



5 Comparison between the four sensors 
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Figure 13: All sensors: Normalised Fisher information I-e/k^ for the single-border configuration as a function of (a) the 
segment width 2w and (b) of the segment location for w — 0.1a. The curve for the Mach-Zehnder was obtained with a 
sharp-edge pinhole with ( — 0.465^o. The curves for the curvature sensor are calculated for a defocussing which guarantees 
that the half width of the segment is approximately ten times larger than the distance from the coordinate corresponding 
to a border to the first zero of the oscillating pattern. 

The Cramer-Rao bounds for the four types of sensors have been discussed in detail in the proceeding sections. 
This section provides a comparison of the relative precision limits which can, in principle, be achieved with these 
sensors for the detection of differential segment phases. Figure 13a shows that for the single-border configuration 
under ideal conditions the performance, defined by the Cramer-Rao bound, of the Shack-Hartmann sensor is 
superior to the performance of the Foucault sensor, which in turn outperforms the Mach-Zchnder sensor for all 
segment widths. For small segment widths the Fisher information of the Shack-Hartmann and the Foucault 
sensors are effectively identical. However, except for the Shack-Hartmann sensor in the centred single-border 
configuration with w = a, none of the sensors attains the quantum statistical limit. 

For the Shack-Hartmann sensor the Fisher information depends linearly on the segment widths. The de- 
pendence is quadratic for the quantum statistical limit, the Foucault sensor, and the Mach-Zchnder sensor with 
pinholes diameters maximising the Fisher information. 

For a Mach-Zehnder sensor with large pinhole diameters, which guarantee that the signals from adjacent 
borders are well separated, the Fisher information is a linear function of the segment width. The maximum value 
which is attained for the centred single-border configuration is approximately 0.03 if the signals from adjacent 
borders are just separated, and decreases with a better separation of the signals. Also the Fisher information of the 
curvature sensor depends, like the Shack-Hartmann sensor as its limiting case for small amounts of defocussing. 
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linearly on the segment width. However, the slope depends strongly on the amount of defocussing, and the 
curve in figure 13 has been calculated for a defocussing which guarantees that the half width of the segment is 
approximately ten times larger than the distance from the coordinate corresponding to a border to the first zero 
of the oscillating pattern. See section 4.4 for further details. 

Figure 13b shows that for all sensors, except for the Mach-Zehnder device, the Fisher information is effectively 
independent of the segment location within the pupil. However, for large pinhole diameters this also applies to 
the Mach-Zehnder sensor. The dip in the curve for the Shack-Hartmann sensor is irrelevant: when such a sensor is 
used in a telescope the subapertures would always be defined by smaller lenslets covering only the neighbourhood 
around a segment border, and the configuration would then be the single-border configuration. 



6 Effects of noise and other parameters 

The effects of broadband illumination, atmospheric disturbances, detector readout noise, and discrete sampling 
of the likelihood function will only be discussed for the Shack-Hartmann sensor in the centred single-border 
configuration. However, the methods used in this section can readily be applied to all other types of sensors and 
configurations. 

6.1 Effects of broadband measurements 
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Figure 14: Shack-Hartmann sensor: (a) Likelihood functions for a phase step d = A/5, (b) normalised Fisher 
information densities for 6 = 0, integrated over different bandpass ratios AA/Ac: Monochromatic (solid), A A/Ac = 0.25 
(dashed), AA/Ac = 0.5 (dashed-dotted). The dotted curve in (a) is the limit to which the likelihood functions converge 
for very large phase steps, that is kO 3> 1, and a bandpass ratio of AA/Ac = 0.5. 

The number of available photons can be increased with broadband measurements, that is by using a larger 
bandpass of the light. The corresponding gain in the information is, however, partially compensated by a loss of 
information due to a loss of form in the likelihood function. As shown below, the Cramer-Rao bound decreases 
with an increase of the bandpass strongly initially small bandpasses, but only slowly at large bandpasses. 

The likelihood function for a broadband measurement is obtained by integrating the monochromatic likelihood 
function over the wavenumber k around a central wavenumber k^ with a suitable prior probability distribution 
or spectral density Pk(fc) as a weight function. In the following calculations Pkik) = const is assumed for any 
bandpass. Figure 14a shows the likelihood function for three bandpasses and a phase step 9 — A/5. Apparently, 
for small phase steps, the position of the maximum of the likelihood function and the width of the central core 
are only weakly affected by the bandpass. 

The dotted line in figure 14 shows the limit to which all curves converge for very large phase steps. For such 
large phase steps, that is for kO ^ \, the sine and cosine functions in equation (A-4) containing kO vary rapidly 
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with k. The terms containing k9 can therefore be replaced by their average values which are 1/2 for the two 
quadratic terms and (1 — cos(fca^))/2 for the mixed term. The dotted curve is then given by 



Pfce»i(0 = 



1 



1 



(7rfca)2 fc2 - fci Jf,, C^ 
where ki and ^2 are the integration limits. 



dfc — [1 — cos(cfc^) cos(fc^a) 



(31) 




Figure 15: Shack-Hartmann sensor: (a) Normalised Fisher information /p/fc^ and (b) normalised Cramer-Rao bound 
ccRB as functions of the ratio of the width A A of the bandpass to the central wavelength Ac. Solid: full information, 
dashed: form-only information. 

Figure 14b shows the curves for the Fisher information densities for the same bandpasses as in figure 14a, 
but for 9 — 0. In the ideal case the peaks in the Fisher information density are, according to equation (1), due 
to the zeroes or small values of the likelihood function at those locations. The peaks are now reduced because 
the zeroes and the small values of the likelihood functions are washed out by the averaging process. The Fisher 
information densities around the second maxima in the curves for 9 — disappear nearly completely for larger 
bandpasses, and the Fisher information density is therefore concentrated closer to the origin. 

Figure 15a shows that, due to the weak dependences of the form of the core and the position of the maximum 
on the bandpass, the Fisher information per photon as a function of the bandpass decreases only weakly with 
the bandpass. This is true for the full information (solid line) as well as for the form-only information (dashed 
line). See section 4.1 for the definition of form. The decrease of the Fisher information per photon with an 
increase of the bandpass is slower than the proportional increase with the number of photons. Therefore the 
Cramer-Rao bound ctcrb of the error in the estimate of the phase step decreases continuously with an increase 
of the bandpass. This can also be seen in figure 15b which shows ctcrb in units of X/(2tt) for one photon per 
nanometer of the bandpass. 



6.2 Effects of atmospheric turbulence 



Turbulence and temperature inhomogeneities in the atmosphere lead to a non-planarity of the incoming wavefront. 
These disturbances reduce the Fisher information, and consequently reduce the precision with which the phase 
step can be estimated. 

The effects of the atmosphere can be taken into account by writing the complex amplitude U{x) in equa- 
tion (A-2) as a product of the complex amplitude Us{x) related to the relative segment phase and a stochastic 
function C/atm(a;) describing the disturbance introduced by the atmosphere. The likelihood function in equa- 
tion (A-3) can then be written as 



pi^\9) = -TTa 



dxp 



+a-x„ 



dxp U*{xp')Us{xp' + Xp) < U*t^{xp' + Xp)U^tm{xp') > e*'=«='p 



(32) 
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Figure 16: Shack-Hartmann sensor: (a) Likelihood functions for a phase step 9 — O.IA under different seeing con- 
ditions, (b) corresponding normalised Fisher information densities for ^ = 0. Seeing conditions: Dotted: no atmosphere, 
Dashed- dotted: ro/2a — 2.25 , Dashed: ro/2a = 1.5 , Solid: ro/2a = 0.75 . 

The expression in the brackets is the atmospheric structure function. For Kolmogorov turbulence and neglecting 
the effects of the inner and outer scales it is given by 



< C/atm(a;p' + a;p)C/atm(a;p') >= exp 



-3.44 ( ^ 
^0 



5/3- 



(33) 



where ro is the atmospheric coherence length. For a given wavelength it can be shown that the likelihood function 
depends on ^ and ^0 only through the ratio ^/^o and on ro and a only through the ratio ro/a. 

Figure 16a shows for a phase step of 6* = O.IA the influence of the atmosphere on the likelihood function 
for four different seeing conditions, expressed as ratios ro/(2a). As in the case of the integration over a finite 
bandpass the location of the maximum does not depend on the seeing conditions. However, the second maxima 
are washed out and the width of the central core increases strongly with deterioration of the seeing. 

The Fisher information densities for 9 = Q and four different seeing conditions are shown in figure 16b. As 
in the case of larger bandpasses and for similar reasons as given in the corresponding section 6.1, under the 
influence of the atmosphere the Fisher information density is more concentrated near the origin than the Fisher 
information density under ideal conditions. 




Figure 17: Shack-Hartmann sensor: (a) Normalised Fisher information /p/fc^ and (b) normalised Cramer-Rao bound 
ccRB as functions of the ratio of the aperture width 2o to the atmospheric coherence length ro. 

For a design of the wave front sensor one would usually assume a given seeing, that is a given value for ro, and 
then define the width of the lens. Therefore, figure 17a shows the normalised Fisher information as a function 
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of the ratio 2a/rQ, the full information as the solid line and the form-only information as the dashed line. The 
strong dependence of the width of the core on the seeing, mentioned above, also explains the strong decrease of 
the Fisher information with an increase of the ratio 2a /tq. If the diameter of the aperture is equal to rg the 
Fisher information is already reduced by a factor of approximately 10 compared to the Fisher information under 
ideal conditions. 

On the one hand the increase of the width of the lens increases the Fisher information due to the larger 
number of photons. On the other hand, the Fisher information per photon decreases rapidly with the size of the 
lens. The combined effect is given by the product of the Fisher information per photon times the width of the 
lens. The corresponding curves for the Cramer-Rao bounds in units of A/(27r) are plotted in figure 17b for a flux 
density of one photon per millimeter of the subaperture. Contrary to the situation for broadband measurements, 
the Cramer-Rao bound has a minimum, which is attained for a ratio 2a/ro ~ 0.4. 




Figure 18: Shack-Hartmann sensor: (a) Normalised Fisher information as a function of the readout flux in units of 
the flux from the sky source, (b) Normalised Fisher information as a function of the pixel hafl-width 5. 



6.2.1 Effects of detector readout noise 

The effect of the readout noise of the detector can be modelled by adding a constant background Cbg to the 



likelihood function in equation (A-4): 






-P^{i\o) 



(34) 



Here po(C | 0) is the likelihood function for one photon without background noise. The value of the constant Cbg 
is the ratio of the constant background to the peak in the ideal likelihood function for 9 = Q. 

Figure 18a shows for a narrowband measurement the full Fisher information (solid lines) and the form-only 
Fisher information (dashed lines) as functions of Cbg. The full Fisher information declines initially much faster 
than the form-only Fisher information. The reason for this is that the zeroes in the likelihood function, which 
are, according to equation (1), responsible for the strong peaks in the full Fisher information density, disappear 
because of the constant background. Since, for large backgrounds, the ratio of the full to the form-only Fisher 
information only equals approximately two, the full Fisher information is also more strongly affected than the 
form-only Fisher information for large backgrounds. Regarding figure 18b, see next section. 



6.2.2 Effects of the detector pixelization 

The effect of finite pixel sizes 5 can be modelled by a product of a comb function, describing the positions of the 
centres of the pixels, and a continuous likelihood function which is obtained by averaging, for each ^, the initial 
likelihood function over the interval [^ — (5/2,^ -t- (5/2]. Figure 19a shows the likelihood functions for (5 ^ (solid). 
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Figure 19: Shack-Hartmann sensor: (a) Likelihood functions for pixel half-width of 5 -^ (solid), 5 = 0.5^o (dashed), 
<5 = Co (dashed-dotted), and 5 = 1.5^0 (dotted), (b) Corresponding Fisher information densities. 



(5 — 0.5^0 (dashed), (5 = ^o (dashed-dotted), and J = 1.5^o (dotted). The corresponding Fisher information 
densities are shown in figure 19b for the full information. The strong decline of the Fisher information densities 
with the pixel size is also apparent from figure 18b, which shows that the decrease is linear for small pixel widths. 
The effect of the finite pixels size is similar for the full Fisher information and the form-only Fisher information. 



7 Conclusions 



This paper has shown that the quantum statistical information relating to a relative segment phase depends 
quadratically on the ratio of the segment width to the width of the aperture. The maximum, given by the square 
of the wavenumber per photon passing through the aperture, is attained for the case where the segment covers 
half of the aperture. In this special case the limiting precision for the measurement of the relative segment phase 
error is equal A/(27r), where A is the wavelength of the light. Under ideal conditions, that is with narrowband 
measurements and negligible atmospheric or detector noise, both the Shack-Hartmann and the Foucault sensor 
can, for small ratios of the segment size to the diameter of the full aperture, potentially extract approximately 
half of the ultimate quantum statistical information. For the special case that the aperture is only filled by 
two segments with the phase step at the centre of the aperture, the Shack-Hartmann device even reaches the 
ultimate precision, and the precision obtained by the Foucault sensor is smaller by a factor of \/2- The other 
two sensors, Mach-Zehnder and curvature, require in general approximately ten to twenty times the number 
of photons in order to reach the same precision as the first two sensors. In the case of the Shack-Hartmann 
sensor, broadband measurements always increase the precision, whereas the optimum width for the subaperture 
corresponding to one lenslet in the presence of atmospheric disturbances is approximately equal to half of the 
atmospheric coherence length. Also detector readout noise and the effects of a finite sampling strongly reduce 
the precision. The effects of these disturbances on the other sensors may be different from the effects on the 
Shack-Hartmann sensor and remain to be investigated. 
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Appendix A : Expressions for the likelihood function and Fisher in- 
formation density for the four sensors 

In this appendix the expressions for the likchhood functions and Fisher information densities for the four sensors 
discussed in the main text are derived. 



Al: Shack-Hartmann sensor 

For monochromatic hght with wavelength A the Fraunhofcr diffraction pattern in the focal plane of an infinitely 
long cylindrical lens covering a one-dimensional subaperture of width la on the mirror can be calculated with a 
Fourier transformation. The diffraction pattern may be viewed as a likelihood function p(^ | ff) for the detection 
of a photon at a coordinate ^ in the focal plane. ^ is a unitless position variable, related to a true position variable 
X in the focal or detector plane by ^ = a;//, where / is the focal length of the lens. If x-^ and x-^ are coordinates 
perpendicular to the axis of the cylindrical lens, and \J the complex field in the plane of the cylindrical lens, the 
likelihood function is given by 

P(CI^) = 1^ <y ''dxpt/*(a;p")e-*'=«^p" y "dVC/(V)e*^"'''> (A-1) 

= l^y "dv/ "dxp <C/*(xp")C/(xp')>e''=«(--'---") . (A-2) 

The bracket denotes a statistical average over a possibly stochastic ensemble of complex amplitudes U . In this 
paper such a stochastic ensemble is used for the treatment of atmospheric effects in section 6.2. 

After the substitution Xp = x^' — x-p" one gets 

P{^\d) = ^ I Axp' I ' dxp < C/*(xp')(7(xp' + xp) > e^'^?"- . (A-3) 

^^ J -a J-a-Xp 

Next, the non-stochastic wave function Us related to the relative segment phase as defined in equation (9), is 
introduced for U in equation (A-3). With ^o = A/(2a) as the location of the first zero in the diffraction pattern 
for 9 — one gets 

w 



P(CI^) = f rrrrr? sin^^/^o) + 2sin2(7r-e/eo)(l -cos(2^0/A)) 



1 1 

fo R7io)2 

w / c c \ 

+2sinK/eo)sin(^-e/eo)(cos(^-^/^o-27r0/A)-cos(-^/^o) • (A-4) 

a \ a a / i 



The Fisher information can easily be computed by introducing the amplitude q{^\0), calculated from equation (A- 
4) following equation (4), into equation (8). The lengthy expression for arbitrary values of c, w, and 6 is not 
quoted here. For the special case oi 9 ^ one obtains the following much shorter expression: 

*f(CI^^0) = 4^— ^sin2(7r-C/eo)sin2(^%eo) • (A-5) 



'r2 6(e/&)2 ^ a 



A2: Mach-Zehnder sensor 



The amplitude Ui in the first, unmodified beam in figure 4 is given by equation (9). After the light in the second 
beam passed through the phase plate and the spatial filter, the amplitude U2 is given by the convolution 

1 f^ 

U2{x) = —= Ax'Ui{x')T{x-x') , (A-6) 

V A j-00 

20 



where T{x) is the inverse Fourier transform of the transmission function (29) of the pinhole. Since the detectors 
are in image planes of the pupil, the same coordinates can be used as for the pupil, that is a; = a;p. The likelihood 
functions on the two detectors are then 

PiA^ \0)^l iPii^ I ^)l' + \U2{x I 9)\^ ± 2Re[C/i*(x | 9)U2{x \ 9)e'^]} , (A-7) 

where the plus sign refers to pi and the minus sign to p2. With the special choice of the phase shift oi (/) = tt/2 
and the abbreviations 

b = fcC (A-8) 

*L = (^(-b{a + x),b{c-w-x)) (A-9) 

*M = ^(b(c-w-x),b(c + w-x)) (A-10) 

*H = <Pib{c + w-x),b{a-x)) (A-11) 



$(u, w) 



{2/tt) J^ dz sin z / z sharp edge pinhole 1 / a i o^ 

(2/^7?) /J dz exp(— z^) Gaussian pinhole J ' 



one obtains for the two likelihood functions 

PiA^ld) = -^{W^(x,a,-a) + i[(*L + *H)' + *|i + 2cos(fc0)(*L + *H)*M] 

± sin(fc6') [W{x, c-w,c + w)(*l + *h) 

-{W{x, -a, c-w) + W{x, c + w, a))*M]} (A-13) 

One of the assumptions for the derivation of equation (2) was that the integrated intensity is independent of 9. 
In the case of the Mach-Zehnder interferometer this is strictly only true for 9^0. For both types of pinholes 
the integrated intensity is a symmetric function of 9 with, according to numerical calculations, a quasi parabolic 
behaviour around 9 — 0. Around the maximum at 9 = the derivative of the integrated intensity with respect 
to 9 is therefore zero. 

The Fisher information is identical in the two beams behind the second beamsplitter. With the definition 

*2 = *2 + M/2j + vl/| , (A-14) 

the total Fisher information as a sum of the Fisher information from both beams behind the second beamsplitter 
is, in the limit 9^0, given by 

/f,mz = 7- [ / dx ^IJ (1 + M/ V4) + / dx {i^l + *|)2 - vl/2j) / (1 + M/2/4) ] . (A-15) 

A3: Foucault sensor 

The Fourier transform T{x) of a Heaviside step function representing the effect of the knife edge in the focal 
plane is given by 

Tix)^^(^±^ + n6ix)^ , (A-16) 

where the signs + and — refer to the two channels with the transmission equal to zero for ^ < and ^ > 0, 
respectively. Introducing equation (A-16) into equation (A-6) gives with the definitions 

{c + w — x)'^ 
{c ~ w — xY 
{a-xf 
[a + a;)2 

the expression for the likelihood function on the two detectors 

±sivL{k9)'K{W{x,-a,a)Y2-W{x,c-w,c + w)Yi)} . (A-19) 



^1 = in )r:. :;, (a-i?) 

Y2 = 1^7^—^ (A-18) 



|C/2,Fc(2;|^)P = ^-^{-Y,^ + 7r^Wix,^a,a) + -{l-cos{k9))Y2{Y2-Yi) 
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The inversion of the sign in front of the sine-term can be understood from the fact that the intensity distribution 
must be symmetric with respect to a change of the transmission from ^ < to ^ > and a simultaneous change 
of the sign of the phase step 9. A similar expression for the centred single-border configuration has been derived 

in [12]. 

For one channel alone the requirement for the integrated intensity to be independent of 9 is not fulfilled, since 
the centre of gravity of the diffraction pattern in the focal plane depends linearly on 9 around = 0, as shown by 
equation (26). However, the requirement is obviously fulfilled for the sum of the integrated intensities in the two 
channels. Therefore, the formalism has to use the sum of the Fisher information densities in the two channels, 
which are, however, in the limit 0^0, independent of the channel. For 0^0 the sum of the Fisher information 
in the two channels is given by 

/f,fc = ^/" {Y2-W{x,c-w,c-w)Yif / {ti^ + Y^/A) (A-20) 

A4: Curvature sensor 

The one-dimensional equivalent of the two-dimensional expression for the complex amplitude in the plane of the 
detector given in [5] [7] is 

1 „ /■+« „ „ 

(A-21) 

(A-22) 
(A-23) 

(A-24) 

dx\Ucy{x\9)\'^ ^l (A-25) 

Introducing Us from equation (9) for U in equation (A-24) one obtains with the abbreviations 

Ci = C((-a-fx/l)/p) Si = S{{-a-fx/l)/p 

C2 = C{{c-w-fx/l)/p) S2 = S{{c-w-,fx/l)/p 

Cg = C{{c + w-Jx/l)/p) S3 = Si{c + w-fx/l)/p , (A-26) 

Ci = C{{a-fx/l)/p) Si = S{{a-fx/l)/p 

where C and S denote the Fresnel integrals, the following expression for the likelihood function: 

\Ucv{x \e)\' - ^ {cl + sl, + CI2 + ^3^2 + cl + sl 

+ 2 COs(fc0) [C21C32 + ^21 ^32 + C32C43 + ^32^43] 

+2 sm{ke) [S12C32 ' C21S32 + 532C43 - C32C43] 

+2C21C43 + 2^21^43) • (A-27) 

The corresponding expression for the Fisher information density can be readily calculated from the equations (5), 
and (A-27) and is not presented here. 
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(A-21) can be written as 


















Ucw{x\e) = ^y 
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The expression for Uqv has been normalised such that 
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